Overview

Dataset statistics

Number of variables31
Number of observations1001
Missing cells429
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory242.6 KiB
Average record size in memory248.1 B

Variable types

NUM17
CAT14

Warnings

situs_state has constant value "1001" Constant
recording_date has a high cardinality: 201 distinct values High cardinality
situs_address has a high cardinality: 962 distinct values High cardinality
situs_city has a high cardinality: 124 distinct values High cardinality
grantor has a high cardinality: 984 distinct values High cardinality
grantee has a high cardinality: 981 distinct values High cardinality
lender has a high cardinality: 213 distinct values High cardinality
borrower has a high cardinality: 814 distinct values High cardinality
grantee_mail_address_raw has a high cardinality: 968 distinct values High cardinality
grantee_mail_city has a high cardinality: 148 distinct values High cardinality
equity_pct is highly correlated with equity and 1 other fieldsHigh correlation
equity is highly correlated with equity_pct and 1 other fieldsHigh correlation
estimated_value_low is highly correlated with estimated_value and 1 other fieldsHigh correlation
estimated_value is highly correlated with estimated_value_low and 1 other fieldsHigh correlation
estimated_value_high is highly correlated with estimated_value and 1 other fieldsHigh correlation
open_loan_amount_1 is highly correlated with equity and 1 other fieldsHigh correlation
lender has 185 (18.5%) missing values Missing
borrower has 185 (18.5%) missing values Missing
grantee_mail_address_raw has 12 (1.2%) missing values Missing
grantee_mail_city has 12 (1.2%) missing values Missing
grantee_mail_state has 12 (1.2%) missing values Missing
grantee_mail_zip has 12 (1.2%) missing values Missing
equity is highly skewed (γ1 = -22.67721679) Skewed
equity_pct is highly skewed (γ1 = -24.22531607) Skewed
open_loan_amount_1 is highly skewed (γ1 = 22.67856163) Skewed
open_loan_amount_2 is highly skewed (γ1 = 29.08717339) Skewed
ratio_sale_avm is highly skewed (γ1 = 22.15535696) Skewed
situs_address is uniformly distributed Uniform
grantor is uniformly distributed Uniform
grantee is uniformly distributed Uniform
borrower is uniformly distributed Uniform
grantee_mail_address_raw is uniformly distributed Uniform
did has unique values Unique
census_tract has 455 (45.5%) zeros Zeros
open_loan_amount_1 has 139 (13.9%) zeros Zeros
open_loan_amount_2 has 747 (74.6%) zeros Zeros
open_loan_amount_3 has 962 (96.1%) zeros Zeros
ratio_sale_tax has 170 (17.0%) zeros Zeros
ratio_sale_avm has 170 (17.0%) zeros Zeros

Reproduction

Analysis started2020-11-12 10:05:08.980488
Analysis finished2020-11-12 10:07:02.370289
Duration1 minute and 53.39 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

pid
Real number (ℝ≥0)

Distinct960
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean737009.6693
Minimum88916
Maximum1405734
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB

Quantile statistics

Minimum88916
5-th percentile119584
Q1341173
median733652
Q31150354
95-th percentile1371908
Maximum1405734
Range1316818
Interquartile range (IQR)809181

Descriptive statistics

Standard deviation428135.8032
Coefficient of variation (CV)0.5809093435
Kurtosis-1.426989518
Mean737009.6693
Median Absolute Deviation (MAD)416458
Skewness0.007405954676
Sum737746679
Variance1.833002659e+11
MonotocityIncreasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
88554930.3%
 
119116030.3%
 
115108220.2%
 
12893920.2%
 
27135620.2%
 
48178820.2%
 
47930120.2%
 
133837820.2%
 
121658720.2%
 
56502420.2%
 
Other values (950)97997.8%
 
ValueCountFrequency (%) 
8891610.1%
 
8891710.1%
 
9131010.1%
 
9151810.1%
 
9155810.1%
 
ValueCountFrequency (%) 
140573410.1%
 
140570510.1%
 
140569310.1%
 
140568410.1%
 
140568310.1%
 

did
Real number (ℝ≥0)

UNIQUE

Distinct1001
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean671366762.2
Minimum660225941
Maximum681332938
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB

Quantile statistics

Minimum660225941
5-th percentile661454226
Q1666174705
median671825774
Q3676474972
95-th percentile680717005
Maximum681332938
Range21106997
Interquartile range (IQR)10300267

Descriptive statistics

Standard deviation5993058.503
Coefficient of variation (CV)0.008926653567
Kurtosis-1.153785612
Mean671366762.2
Median Absolute Deviation (MAD)5165264
Skewness-0.08632378751
Sum6.72038129e+11
Variance3.591675022e+13
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
66819891110.1%
 
66212523010.1%
 
66820539410.1%
 
66680288710.1%
 
67469565810.1%
 
66876281910.1%
 
67918509010.1%
 
66836755210.1%
 
67967864810.1%
 
66031071110.1%
 
Other values (991)99199.0%
 
ValueCountFrequency (%) 
66022594110.1%
 
66022595610.1%
 
66022659810.1%
 
66022677810.1%
 
66022700410.1%
 
ValueCountFrequency (%) 
68133293810.1%
 
68133291710.1%
 
68126584110.1%
 
68126569410.1%
 
68126506710.1%
 

recording_date
Categorical

HIGH CARDINALITY

Distinct201
Distinct (%)20.1%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
2020-09-18
 
16
2020-05-29
 
15
2020-07-31
 
14
2020-02-28
 
13
2020-08-14
 
13
Other values (196)
930 
ValueCountFrequency (%) 
2020-09-18161.6%
 
2020-05-29151.5%
 
2020-07-31141.4%
 
2020-02-28131.3%
 
2020-08-14131.3%
 
2020-06-26121.2%
 
2020-09-04111.1%
 
2020-07-08111.1%
 
2020-07-09111.1%
 
2020-08-07111.1%
 
Other values (191)87487.3%
 
Frequencies of value counts

Unique

Unique17 ?
Unique (%)1.7%
Histogram of lengths of the category

Length

Max length10
Median length10
Mean length10
Min length10

dt_update
Categorical

Distinct40
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
2020-07-23
204 
2020-08-27
135 
2020-09-24
108 
2020-11-05
54 
2020-04-09
50 
Other values (35)
450 
ValueCountFrequency (%) 
2020-07-2320420.4%
 
2020-08-2713513.5%
 
2020-09-2410810.8%
 
2020-11-05545.4%
 
2020-04-09505.0%
 
2020-10-08414.1%
 
2020-08-20393.9%
 
2020-10-01373.7%
 
2020-10-22353.5%
 
2020-03-12323.2%
 
Other values (30)26626.6%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length10
Median length10
Mean length10
Min length10

qtr
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
2020q3
417 
2020q2
257 
2020q1
239 
2020q4
88 
ValueCountFrequency (%) 
2020q341741.7%
 
2020q225725.7%
 
2020q123923.9%
 
2020q4888.8%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length6
Mean length6
Min length6

fips
Real number (ℝ≥0)

Distinct16
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53044.72827
Minimum53005
Maximum53077
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB

Quantile statistics

Minimum53005
5-th percentile53011
Q153033
median53053
Q353061
95-th percentile53067
Maximum53077
Range72
Interquartile range (IQR)28

Descriptive statistics

Standard deviation18.68320342
Coefficient of variation (CV)0.0003522160265
Kurtosis-0.9079509875
Mean53044.72827
Median Absolute Deviation (MAD)14
Skewness-0.4612784046
Sum53097773
Variance349.0620899
MonotocityNot monotonic
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%) 
5303324624.6%
 
5305318618.6%
 
5306114314.3%
 
5306312212.2%
 
5301110110.1%
 
53067505.0%
 
53035383.8%
 
53077212.1%
 
53057191.9%
 
53021191.9%
 
Other values (6)565.6%
 
ValueCountFrequency (%) 
53005131.3%
 
5300710.1%
 
5301110110.1%
 
53015161.6%
 
53021191.9%
 
ValueCountFrequency (%) 
53077212.1%
 
5307370.7%
 
53067505.0%
 
5306312212.2%
 
5306114314.3%
 

situs_address
Categorical

HIGH CARDINALITY
UNIFORM

Distinct962
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
2319 E Rowan Ave
 
3
2701 H St
 
3
6100 E Evergreen Blvd
 
2
5106 164th St Sw
 
2
433 Commercial Ave
 
2
Other values (957)
989 
ValueCountFrequency (%) 
2319 E Rowan Ave30.3%
 
2701 H St30.3%
 
6100 E Evergreen Blvd20.2%
 
5106 164th St Sw20.2%
 
433 Commercial Ave20.2%
 
2609 34th Ave W20.2%
 
2711 Unander Ave20.2%
 
3627 Preble St20.2%
 
19512 SE 32nd Dr20.2%
 
815 26th Ave S20.2%
 
Other values (952)97997.8%
 
Frequencies of value counts

Unique

Unique925 ?
Unique (%)92.4%
Histogram of lengths of the category

Length

Max length30
Median length16
Mean length16.41158841
Min length9

situs_city
Categorical

HIGH CARDINALITY

Distinct124
Distinct (%)12.4%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
Tacoma
96 
Spokane
85 
Seattle
83 
Vancouver
75 
Everett
 
31
Other values (119)
631 
ValueCountFrequency (%) 
Tacoma969.6%
 
Spokane858.5%
 
Seattle838.3%
 
Vancouver757.5%
 
Everett313.1%
 
Puyallup272.7%
 
Olympia262.6%
 
Spanaway252.5%
 
Spokane Valley232.3%
 
Marysville191.9%
 
Other values (114)51151.0%
 
Frequencies of value counts

Unique

Unique33 ?
Unique (%)3.3%
Histogram of lengths of the category

Length

Max length17
Median length7
Mean length7.92007992
Min length3

situs_state
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
WA
1001 
ValueCountFrequency (%) 
WA1001100.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

situs_zip
Real number (ℝ≥0)

Distinct217
Distinct (%)21.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean98464.32068
Minimum98001
Maximum99354
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB

Quantile statistics

Minimum98001
5-th percentile98023
Q198168
median98375
Q398662
95-th percentile99216
Maximum99354
Range1353
Interquartile range (IQR)494

Descriptive statistics

Standard deviation387.6511035
Coefficient of variation (CV)0.003936970274
Kurtosis-0.304739379
Mean98464.32068
Median Absolute Deviation (MAD)251
Skewness0.8763773306
Sum98562785
Variance150273.3781
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
99207252.5%
 
98387252.5%
 
99205202.0%
 
98404202.0%
 
98409202.0%
 
99301171.7%
 
98444141.4%
 
98026141.4%
 
98198141.4%
 
98201131.3%
 
Other values (207)81981.8%
 
ValueCountFrequency (%) 
9800190.9%
 
9800230.3%
 
9800340.4%
 
9800410.1%
 
9800630.3%
 
ValueCountFrequency (%) 
9935430.3%
 
9935310.1%
 
9935220.2%
 
9934910.1%
 
9933810.1%
 

census_tract
Real number (ℝ≥0)

ZEROS

Distinct267
Distinct (%)26.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25558.64935
Minimum0
Maximum96110
Zeros455
Zeros (%)45.5%
Memory size7.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1300
Q353509
95-th percentile90201
Maximum96110
Range96110
Interquartile range (IQR)53509

Descriptive statistics

Standard deviation32460.26296
Coefficient of variation (CV)1.270030451
Kurtosis-1.07470747
Mean25558.64935
Median Absolute Deviation (MAD)1300
Skewness0.7451539691
Sum25584208
Variance1053668672
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
045545.5%
 
94000121.2%
 
63300101.0%
 
6290090.9%
 
7311490.9%
 
70080.8%
 
150080.8%
 
20070.7%
 
7312570.7%
 
7040360.6%
 
Other values (257)47047.0%
 
ValueCountFrequency (%) 
045545.5%
 
10010.1%
 
20070.7%
 
30030.3%
 
40020.2%
 
ValueCountFrequency (%) 
9611020.2%
 
9609020.2%
 
9608020.2%
 
9604010.1%
 
9603010.1%
 
Distinct8
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
0
455 
1
176 
2
149 
3
127 
4
56 
Other values (3)
 
38
ValueCountFrequency (%) 
045545.5%
 
117617.6%
 
214914.9%
 
312712.7%
 
4565.6%
 
5282.8%
 
690.9%
 
710.1%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)0.1%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

grantor
Categorical

HIGH CARDINALITY
UNIFORM

Distinct984
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
Bank of America NA
 
5
Flyhomes Investments Wa LLC
 
3
Dawn M Loftis
 
2
Bt Property Investments LLC
 
2
Rainier General Dev Inc
 
2
Other values (979)
987 
ValueCountFrequency (%) 
Bank of America NA50.5%
 
Flyhomes Investments Wa LLC30.3%
 
Dawn M Loftis20.2%
 
Bt Property Investments LLC20.2%
 
Rainier General Dev Inc20.2%
 
National Residl Nominee SVCS20.2%
 
Brandon M Minga20.2%
 
Dorean Properties LLC20.2%
 
Cory S Colvin20.2%
 
Meiwan Zou20.2%
 
Other values (974)97797.6%
 
Frequencies of value counts

Unique

Unique971 ?
Unique (%)97.0%
Histogram of lengths of the category

Length

Max length52
Median length18
Mean length19.73326673
Min length6

grantee
Categorical

HIGH CARDINALITY
UNIFORM

Distinct981
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
Ross Road 97 Invs PTRS LLC
 
4
Ih6 Property Washington LP
 
4
Ginn Group LLC
 
3
Flyhomes Investments Wa LLC
 
3
Secretary of Hsng & Urban Dev
 
2
Other values (976)
985 
ValueCountFrequency (%) 
Ross Road 97 Invs PTRS LLC40.4%
 
Ih6 Property Washington LP40.4%
 
Ginn Group LLC30.3%
 
Flyhomes Investments Wa LLC30.3%
 
Secretary of Hsng & Urban Dev20.2%
 
Wilmington Trust NA 2015-1 TR20.2%
 
Bp Family Booth Trust20.2%
 
Repair Pro Wa LLC20.2%
 
National Residl Nominee SVCS20.2%
 
Matthew & Savanna Owens20.2%
 
Other values (971)97597.4%
 
Frequencies of value counts

Unique

Unique967 ?
Unique (%)96.6%
Histogram of lengths of the category

Length

Max length52
Median length23
Mean length23.27072927
Min length8

lender
Categorical

HIGH CARDINALITY
MISSING

Distinct213
Distinct (%)26.1%
Missing185
Missing (%)18.5%
Memory size7.8 KiB
Fairway Independent MTG
 
56
Caliber Hm Loans
 
54
Guild MTG
 
30
Evergreen Moneysource MTG
 
29
Homebridge Fin'l SVCS Inc
 
20
Other values (208)
627 
ValueCountFrequency (%) 
Fairway Independent MTG565.6%
 
Caliber Hm Loans545.4%
 
Guild MTG303.0%
 
Evergreen Moneysource MTG292.9%
 
Homebridge Fin'l SVCS Inc202.0%
 
Umpqua BK191.9%
 
Movement MTG181.8%
 
Fairway Independent Mtg171.7%
 
Crosscountry MTG171.7%
 
Wells Fargo BK151.5%
 
Other values (203)54154.0%
 
(Missing)18518.5%
 
Frequencies of value counts

Unique

Unique118 ?
Unique (%)14.5%
Histogram of lengths of the category

Length

Max length38
Median length14
Mean length13.48651349
Min length3

borrower
Categorical

HIGH CARDINALITY
MISSING
UNIFORM

Distinct814
Distinct (%)99.8%
Missing185
Missing (%)18.5%
Memory size7.8 KiB
Matthew & Savanna Owens
 
2
Twin Estates LLC
 
2
Ariel Garcia | Calvin Jr Lindsay
 
1
Christine L Razee
 
1
Raymond L & Nicole L Guerrero
 
1
Other values (809)
809 
ValueCountFrequency (%) 
Matthew & Savanna Owens20.2%
 
Twin Estates LLC20.2%
 
Ariel Garcia | Calvin Jr Lindsay10.1%
 
Christine L Razee10.1%
 
Raymond L & Nicole L Guerrero10.1%
 
Brian & Erlinda A Tuskan10.1%
 
Genolinsky M & Rowena R Lopez10.1%
 
Van V & Joe Cao | Weijun Ouyang10.1%
 
Travis Betts10.1%
 
William A & Nicole L Frank10.1%
 
Other values (804)80480.3%
 
(Missing)18518.5%
 
Frequencies of value counts

Unique

Unique812 ?
Unique (%)99.5%
Histogram of lengths of the category

Length

Max length52
Median length21
Mean length19.93206793
Min length3

grantee_mail_address_raw
Categorical

HIGH CARDINALITY
MISSING
UNIFORM

Distinct968
Distinct (%)97.9%
Missing12
Missing (%)1.2%
Memory size7.8 KiB
1717 MAIN ST #2000 DALLAS TX 752014657
 
4
400 N 34TH ST #300 SEATTLE WA 981038600
 
4
1201 WESTERN AVE #100 SEATTLE WA 981012953
 
3
7223 NE HAZEL DELL AVE VANCOUVER WA 986658326
 
3
32127 E MORRISON ST CARNATION WA 980145057
 
2
Other values (963)
973 
ValueCountFrequency (%) 
1717 MAIN ST #2000 DALLAS TX 75201465740.4%
 
400 N 34TH ST #300 SEATTLE WA 98103860040.4%
 
1201 WESTERN AVE #100 SEATTLE WA 98101295330.3%
 
7223 NE HAZEL DELL AVE VANCOUVER WA 98665832630.3%
 
32127 E MORRISON ST CARNATION WA 98014505720.2%
 
706 W GARLAND AVE SPOKANE WA 99205295920.2%
 
201 N 30TH ST MOUNT VERNON WA 98273367120.2%
 
5324 BELLAIRE AVE VALLEY VILLAGE CA 91607233020.2%
 
11632 74TH AVE S SEATTLE WA 98178300920.2%
 
19512 SE 32ND DR CAMAS WA 98607944720.2%
 
Other values (958)96396.2%
 
(Missing)121.2%
 
Frequencies of value counts

Unique

Unique953 ?
Unique (%)96.4%
Histogram of lengths of the category

Length

Max length57
Median length38
Mean length38.0969031
Min length3

grantee_mail_city
Categorical

HIGH CARDINALITY
MISSING

Distinct148
Distinct (%)15.0%
Missing12
Missing (%)1.2%
Memory size7.8 KiB
SEATTLE
95 
TACOMA
82 
SPOKANE
76 
VANCOUVER
68 
EVERETT
 
29
Other values (143)
639 
ValueCountFrequency (%) 
SEATTLE959.5%
 
TACOMA828.2%
 
SPOKANE767.6%
 
VANCOUVER686.8%
 
EVERETT292.9%
 
PUYALLUP262.6%
 
OLYMPIA252.5%
 
SPANAWAY222.2%
 
SPOKANE VALLEY212.1%
 
PASCO171.7%
 
Other values (138)52852.7%
 
Frequencies of value counts

Unique

Unique53 ?
Unique (%)5.4%
Histogram of lengths of the category

Length

Max length17
Median length7
Mean length7.946053946
Min length3

grantee_mail_state
Categorical

MISSING

Distinct14
Distinct (%)1.4%
Missing12
Missing (%)1.2%
Memory size7.8 KiB
WA
958 
CA
 
8
TX
 
6
AZ
 
3
OR
 
3
Other values (9)
 
11
ValueCountFrequency (%) 
WA95895.7%
 
CA80.8%
 
TX60.6%
 
AZ30.3%
 
OR30.3%
 
FL20.2%
 
ID20.2%
 
KS10.1%
 
OK10.1%
 
MA10.1%
 
Other values (4)40.4%
 
(Missing)121.2%
 
Frequencies of value counts

Unique

Unique7 ?
Unique (%)0.7%
Histogram of lengths of the category

Length

Max length3
Median length2
Mean length2.011988012
Min length2

grantee_mail_zip
Real number (ℝ≥0)

MISSING

Distinct250
Distinct (%)25.3%
Missing12
Missing (%)1.2%
Infinite0
Infinite (%)0.0%
Mean97784.77958
Minimum1089
Maximum99354
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB

Quantile statistics

Minimum1089
5-th percentile98006
Q198109
median98370
Q398632
95-th percentile99212
Maximum99354
Range98265
Interquartile range (IQR)523

Descriptive statistics

Standard deviation5683.751917
Coefficient of variation (CV)0.05812511867
Kurtosis176.6087417
Mean97784.77958
Median Absolute Deviation (MAD)262
Skewness-12.38259215
Sum96709147
Variance32305035.86
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
98387222.2%
 
99207212.1%
 
99205191.9%
 
99301171.7%
 
98409171.7%
 
98404141.4%
 
98374121.2%
 
98106121.2%
 
98026121.2%
 
98258121.2%
 
Other values (240)83183.0%
 
ValueCountFrequency (%) 
108910.1%
 
1001910.1%
 
3225610.1%
 
3468910.1%
 
6705810.1%
 
ValueCountFrequency (%) 
9935430.3%
 
9935310.1%
 
9935220.2%
 
9935010.1%
 
9933810.1%
 

equity
Real number (ℝ)

HIGH CORRELATION
SKEWED

Distinct862
Distinct (%)86.2%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean-2177929.927
Minimum-1300077000
Maximum2215000
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB

Quantile statistics

Minimum-1300077000
5-th percentile16050
Q186959.75
median144370
Q3245420.5
95-th percentile515170
Maximum2215000
Range1302292000
Interquartile range (IQR)158460.75

Descriptive statistics

Standard deviation52965185.67
Coefficient of variation (CV)-24.31904949
Kurtosis518.4077116
Mean-2177929.927
Median Absolute Deviation (MAD)72355
Skewness-22.67721679
Sum-2177929927
Variance2.805310893e+15
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
10900050.5%
 
21100040.4%
 
26700040.4%
 
18700040.4%
 
16200040.4%
 
13500040.4%
 
5500030.3%
 
46200030.3%
 
23600030.3%
 
23700030.3%
 
Other values (852)96396.2%
 
ValueCountFrequency (%) 
-130007700010.1%
 
-105681000010.1%
 
-1039700010.1%
 
-254400010.1%
 
-186859910.1%
 
ValueCountFrequency (%) 
221500010.1%
 
188800010.1%
 
168043210.1%
 
147900010.1%
 
133600010.1%
 

equity_pct
Real number (ℝ)

HIGH CORRELATION
SKEWED

Distinct874
Distinct (%)87.4%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean-6.385185376
Minimum-4248.62
Maximum1
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB

Quantile statistics

Minimum-4248.62
5-th percentile0.04777772
Q10.27598
median0.4018405
Q30.575424
95-th percentile1
Maximum1
Range4249.62
Interquartile range (IQR)0.299444

Descriptive statistics

Standard deviation156.5884261
Coefficient of variation (CV)-24.5237087
Kurtosis609.2031152
Mean-6.385185376
Median Absolute Deviation (MAD)0.143695
Skewness-24.22531607
Sum-6385.185376
Variance24519.9352
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
111411.4%
 
0.25177320.2%
 
0.3574320.2%
 
0.2759820.2%
 
0.4189620.2%
 
0.49491520.2%
 
0.79296120.2%
 
0.33333320.2%
 
0.10614120.2%
 
0.55462220.2%
 
Other values (864)86886.7%
 
ValueCountFrequency (%) 
-4248.6210.1%
 
-2546.5310.1%
 
-15.681710.1%
 
-7.1460710.1%
 
-4.0888410.1%
 
ValueCountFrequency (%) 
111411.4%
 
0.99557910.1%
 
0.99014710.1%
 
0.95540710.1%
 
0.95231910.1%
 

estimated_value
Real number (ℝ≥0)

HIGH CORRELATION

Distinct505
Distinct (%)50.5%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean433215
Minimum103000
Maximum2928000
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB

Quantile statistics

Minimum103000
5-th percentile181000
Q1282000
median360000
Q3500000
95-th percentile864400
Maximum2928000
Range2825000
Interquartile range (IQR)218000

Descriptive statistics

Standard deviation270037.1397
Coefficient of variation (CV)0.623332848
Kurtosis20.14808685
Mean433215
Median Absolute Deviation (MAD)99000
Skewness3.506902995
Sum433215000
Variance7.292005683e+10
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
31600080.8%
 
29600080.8%
 
31300080.8%
 
25900070.7%
 
32000070.7%
 
35400070.7%
 
34700070.7%
 
32600070.7%
 
28200060.6%
 
28100060.6%
 
Other values (495)92992.8%
 
ValueCountFrequency (%) 
10300020.2%
 
11300010.1%
 
11400010.1%
 
11600010.1%
 
12400010.1%
 
ValueCountFrequency (%) 
292800010.1%
 
270500010.1%
 
222900010.1%
 
215200010.1%
 
201500010.1%
 

estimated_value_low
Real number (ℝ≥0)

HIGH CORRELATION

Distinct479
Distinct (%)47.9%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean358481
Minimum74000
Maximum2238000
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB

Quantile statistics

Minimum74000
5-th percentile137000
Q1232000
median307000
Q3421000
95-th percentile700050
Maximum2238000
Range2164000
Interquartile range (IQR)189000

Descriptive statistics

Standard deviation217016.8518
Coefficient of variation (CV)0.6053789511
Kurtosis16.46734417
Mean358481
Median Absolute Deviation (MAD)88000
Skewness3.087761106
Sum358481000
Variance4.709631395e+10
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
28400080.8%
 
21300080.8%
 
29000070.7%
 
29400070.7%
 
30100070.7%
 
27700070.7%
 
27500070.7%
 
29800070.7%
 
24000070.7%
 
35200060.6%
 
Other values (469)92992.8%
 
ValueCountFrequency (%) 
7400010.1%
 
7700010.1%
 
8000010.1%
 
8200010.1%
 
8700010.1%
 
ValueCountFrequency (%) 
223800010.1%
 
220100010.1%
 
163600010.1%
 
158500010.1%
 
157700010.1%
 

estimated_value_high
Real number (ℝ≥0)

HIGH CORRELATION

Distinct546
Distinct (%)54.6%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean507951
Minimum126000
Maximum3655000
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB

Quantile statistics

Minimum126000
5-th percentile222000
Q1330000
median414500
Q3581000
95-th percentile1028950
Maximum3655000
Range3529000
Interquartile range (IQR)251000

Descriptive statistics

Standard deviation325733.0481
Coefficient of variation (CV)0.6412686422
Kurtosis22.93952167
Mean507951
Median Absolute Deviation (MAD)109500
Skewness3.790656648
Sum507951000
Variance1.061020186e+11
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
44800090.9%
 
32300080.8%
 
47200060.6%
 
34600060.6%
 
38900060.6%
 
38600060.6%
 
29700060.6%
 
57300050.5%
 
41300050.5%
 
37100050.5%
 
Other values (536)93893.7%
 
ValueCountFrequency (%) 
12600010.1%
 
13200010.1%
 
14500010.1%
 
14600010.1%
 
15000010.1%
 
ValueCountFrequency (%) 
365500010.1%
 
317200010.1%
 
293000010.1%
 
271900010.1%
 
239400010.1%
 

open_loan_amount_1
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct729
Distinct (%)72.9%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean2572099.939
Minimum0
Maximum1300383000
Zeros139
Zeros (%)13.9%
Memory size7.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q1116982.75
median194860
Q3284357.5
95-th percentile495000
Maximum1300383000
Range1300383000
Interquartile range (IQR)167374.75

Descriptive statistics

Standard deviation52962161.32
Coefficient of variation (CV)20.59102001
Kurtosis518.4406483
Mean2572099.939
Median Absolute Deviation (MAD)83710
Skewness22.67856163
Sum2572099939
Variance2.804990531e+15
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
013913.9%
 
41700080.8%
 
19000050.5%
 
16000050.5%
 
15000050.5%
 
28500050.5%
 
28800050.5%
 
14400040.4%
 
18000040.4%
 
16400040.4%
 
Other values (719)81681.5%
 
ValueCountFrequency (%) 
013913.9%
 
2000010.1%
 
2257510.1%
 
3450010.1%
 
3650010.1%
 
ValueCountFrequency (%) 
130038300010.1%
 
105722500010.1%
 
232559910.1%
 
165000010.1%
 
161000010.1%
 

open_loan_amount_2
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct179
Distinct (%)17.9%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean35196.167
Minimum0
Maximum11060000
Zeros747
Zeros (%)74.6%
Memory size7.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31795.25
95-th percentile137880.85
Maximum11060000
Range11060000
Interquartile range (IQR)1795.25

Descriptive statistics

Standard deviation359388.9477
Coefficient of variation (CV)10.21102519
Kurtosis889.3066603
Mean35196.167
Median Absolute Deviation (MAD)0
Skewness29.08717339
Sum35196167
Variance1.291604157e+11
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
074774.6%
 
50000171.7%
 
100000151.5%
 
2500060.6%
 
2000060.6%
 
7500050.5%
 
20000040.4%
 
3500030.3%
 
1000030.3%
 
11000030.3%
 
Other values (169)19119.1%
 
ValueCountFrequency (%) 
074774.6%
 
120.2%
 
154310.1%
 
255210.1%
 
330010.1%
 
ValueCountFrequency (%) 
1106000010.1%
 
170000010.1%
 
83500010.1%
 
72800010.1%
 
50000010.1%
 

open_loan_amount_3
Real number (ℝ≥0)

ZEROS

Distinct39
Distinct (%)3.9%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean3848.821
Minimum0
Maximum375000
Zeros962
Zeros (%)96.1%
Memory size7.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum375000
Range375000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation26767.77557
Coefficient of variation (CV)6.954798773
Kurtosis101.9618684
Mean3848.821
Median Absolute Deviation (MAD)0
Skewness9.398289706
Sum3848821
Variance716513808.9
MonotocityNot monotonic
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%) 
096296.1%
 
10000010.1%
 
832610.1%
 
6415010.1%
 
15880010.1%
 
6570010.1%
 
5000110.1%
 
2000010.1%
 
5000010.1%
 
8950010.1%
 
Other values (29)292.9%
 
ValueCountFrequency (%) 
096296.1%
 
743310.1%
 
812810.1%
 
832610.1%
 
908310.1%
 
ValueCountFrequency (%) 
37500010.1%
 
37000010.1%
 
25060010.1%
 
25000010.1%
 
24000010.1%
 

ratio_sale_tax
Real number (ℝ≥0)

ZEROS

Distinct814
Distinct (%)81.4%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean1.171191565
Minimum0
Maximum37.628
Zeros170
Zeros (%)17.0%
Memory size7.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10.966265
median1.25177
Q31.40812
95-th percentile1.8300555
Maximum37.628
Range37.628
Interquartile range (IQR)0.441855

Descriptive statistics

Standard deviation1.521617291
Coefficient of variation (CV)1.299204448
Kurtosis360.3089051
Mean1.171191565
Median Absolute Deviation (MAD)0.198555
Skewness16.58444876
Sum1171.191565
Variance2.315319179
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
017017.0%
 
1.3946620.2%
 
1.3888920.2%
 
1.1075520.2%
 
1.1904820.2%
 
1.0040220.2%
 
1.2852920.2%
 
1.3358120.2%
 
1.1155820.2%
 
1.3849420.2%
 
Other values (804)81281.1%
 
ValueCountFrequency (%) 
017017.0%
 
0.013033910.1%
 
0.019544810.1%
 
0.021370710.1%
 
0.022177110.1%
 
ValueCountFrequency (%) 
37.62810.1%
 
18.597710.1%
 
17.650310.1%
 
5.9882610.1%
 
5.9096710.1%
 

ratio_sale_avm
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct779
Distinct (%)77.9%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean0.8677135854
Minimum0
Maximum29.9095
Zeros170
Zeros (%)17.0%
Memory size7.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10.7456355
median0.994285
Q31.07325
95-th percentile1.301014
Maximum29.9095
Range29.9095
Interquartile range (IQR)0.3276145

Descriptive statistics

Standard deviation1.035829208
Coefficient of variation (CV)1.193745523
Kurtosis619.8855017
Mean0.8677135854
Median Absolute Deviation (MAD)0.111135
Skewness22.15535696
Sum867.7135854
Variance1.072942147
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
017017.0%
 
1101.0%
 
1.0344840.4%
 
1.0204140.4%
 
1.0294130.3%
 
1.1111130.3%
 
1.0520.2%
 
1.02520.2%
 
0.90517220.2%
 
1.0330620.2%
 
Other values (769)79879.7%
 
ValueCountFrequency (%) 
017017.0%
 
0.010253310.1%
 
0.014911210.1%
 
0.015228510.1%
 
0.016391210.1%
 
ValueCountFrequency (%) 
29.909510.1%
 
4.9068210.1%
 
4.8534910.1%
 
3.4638610.1%
 
3.1830910.1%
 

square_footage
Real number (ℝ≥0)

Distinct628
Distinct (%)62.8%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean1588.479
Minimum504
Maximum13787
Zeros0
Zeros (%)0.0%
Memory size7.8 KiB

Quantile statistics

Minimum504
5-th percentile803.8
Q11121.75
median1391
Q31840
95-th percentile2849.75
Maximum13787
Range13283
Interquartile range (IQR)718.25

Descriptive statistics

Standard deviation873.0548404
Coefficient of variation (CV)0.5496168601
Kurtosis76.93192069
Mean1588.479
Median Absolute Deviation (MAD)339.5
Skewness6.328318678
Sum1588479
Variance762224.7543
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1140101.0%
 
116090.9%
 
96070.7%
 
152060.6%
 
76860.6%
 
84060.6%
 
120060.6%
 
124060.6%
 
128060.6%
 
126060.6%
 
Other values (618)93293.1%
 
ValueCountFrequency (%) 
50410.1%
 
54020.2%
 
57610.1%
 
59010.1%
 
59810.1%
 
ValueCountFrequency (%) 
1378720.2%
 
555010.1%
 
548010.1%
 
546410.1%
 
514010.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Missing values

Sample

First rows

piddidrecording_datedt_updateqtrfipssitus_addresssitus_citysitus_statesitus_zipcensus_tractcensus_block_groupgrantorgranteelenderborrowergrantee_mail_address_rawgrantee_mail_citygrantee_mail_stategrantee_mail_zipequityequity_pctestimated_valueestimated_value_lowestimated_value_highopen_loan_amount_1open_loan_amount_2open_loan_amount_3ratio_sale_taxratio_sale_avmsquare_footage
0889166688491432020-05-262020-07-232020q253063421 S Custer RdSpokane ValleyWA99212123004Tyler & Saralynn DilgGuy Henne | Erica A WhitmarshGuild MTGGuy Henne | Erica A Whitmarsh421 S CUSTER RD SPOKANE VALLEY WA 992120324SPOKANE VALLEYWA99212.0113000.00.551220205000.0158000.0253000.092000.00.00.01.610251.097561898.0
1889176712004002020-06-232020-07-232020q25306311109 E Springfield AveSpokane ValleyWA99206119002Lana LenerizJoshua L & Shannan G PyleLOANDEPOT.COM LLCJoshua L & Shannan G Pyle11109 E SPRINGFIELD AVE SPOKANE VALLEY WA 992066225SPOKANE VALLEYWA99206.0145250.00.600207242000.0199000.0285000.096750.00.00.01.958771.384091233.0
2913106666619202020-04-272020-05-142020q2530336828 S 116th PlSeattleWA9817800Adisu FantaHugo Velazquez | Krystal BautistaQuicken LnsHugo Velazquez | Krystal Bautista6828 S 116TH PL TUKWILA WA 981783025TUKWILAWA98178.0159000.00.362187439000.0376000.0502000.0280000.00.00.01.295261.059231450.0
3915186682202642020-05-262020-06-112020q25301121811 NE 28th StCamasWA9860700Scott J CurrieMarc & Valerie V MouserPenrith Hm LnsMarc & Valerie V Mouser21811 NE 28TH ST CAMAS WA 986079298CAMASWA98607.071680.00.188136381000.0307000.0455000.0309320.00.00.00.000000.000001156.0
4915586780527672020-09-142020-10-012020q35306119926 York RdBothellWA98012519242David L & Victoria McNeelySnohomish CountyNaNNaN3000 ROCKEFELLER AVE EVERETT WA 982014071EVERETTWA98201.0251500.00.528361476000.0399000.0553000.0224500.00.00.00.000000.000001236.0
5930186625762872020-02-052020-08-272020q153021519 S Douglas AvePascoWA9930100Joseph B ZilarDustin P Nelson | Yessenia MendozaHomebridge Fin'l SVCS IncDustin P Nelson | Yessenia Mendoza519 S DOUGLAS AVE PASCO WA 993014412PASCOWA99301.0103393.00.469968220000.0179000.0260000.0116607.00.00.01.881341.045001116.0
6934536807176932020-10-132020-11-052020q45306716266 Old Highway 99 SeTeninoWA9858900Rainier General Dev IncJohn RobinsonNaNNaN320 BRIAR LN S TENINO WA 98589TENINOWA98589.0319000.01.000000319000.0240000.0399000.00.00.00.00.000000.0000013787.0
7934536813329382020-10-202020-11-052020q45306716266 Old Highway 99 SeTeninoWA9858900Rainier General Dev IncWilliam A & Nicole L FrankCaliber Hm LoansWilliam A & Nicole L Frank381 BRIAR LN S TENINO WA 98589TENINOWA98589.0319000.01.000000319000.0240000.0399000.00.00.00.00.000000.0000013787.0
8939326810396192020-10-202020-11-052020q45303526039 Leyman Ln NeKingstonWA98346901021Harvey WolffElray & Sharon KonkelWashington Fed'l BKElray & Sharon Konkel26039 LEYMAN LN NE KINGSTON WA 983469421KINGSTONWA98346.0912000.01.000000912000.0704000.01120000.00.00.00.00.000000.000003654.0
9939526796794062020-10-022020-10-222020q4530534825 S D StTacomaWA98408624004Stephen P CoxAngelic Properties LLCEastside FNDGAngelic Properties LLC4825 S D ST TACOMA WA 984086511TACOMAWA98408.0179000.00.566456316000.0245000.0386000.0109600.027400.00.00.000000.000001163.0

Last rows

piddidrecording_datedt_updateqtrfipssitus_addresssitus_citysitus_statesitus_zipcensus_tractcensus_block_groupgrantorgranteelenderborrowergrantee_mail_address_rawgrantee_mail_citygrantee_mail_stategrantee_mail_zipequityequity_pctestimated_valueestimated_value_lowestimated_value_highopen_loan_amount_1open_loan_amount_2open_loan_amount_3ratio_sale_taxratio_sale_avmsquare_footage
99114056136780095732020-09-092020-10-012020q3530539311 62nd Ave EPuyallupWA98371712052Erica R BrancheJose A G Soto | Daisy G GomezFairway Independent MTGJose A G Soto | Daisy G Gomez9311 62ND AVE E PUYALLUP WA 983716251PUYALLUPWA98371.0125400.00.303632413000.0361000.0465000.0287600.00.00.00.0000000.0000001645.0
99214056266727680562020-07-162020-08-272020q3530538802 63rd Ave EPuyallupWA98371712052Carrie M RuttAustin G AndalParamount Resid'l MTGAustin G Andal8802 63RD AVE E PUYALLUP WA 983716227PUYALLUPWA98371.0131000.00.370057354000.0305000.0403000.0223000.00.00.01.4230201.1384201205.0
99314056376630934192020-02-122020-04-092020q1530536302 90th Street Ct EPuyallupWA98371712052Adam WilliamsWiktoria A SrebroEvergreen Moneysource MtgWiktoria A Srebro6302 90TH STREET CT E PUYALLUP WA 983716281PUYALLUPWA98371.03450.00.009914348000.0307000.0389000.0313550.031000.00.01.3508801.1063201357.0
99414056696729232932020-07-212020-08-272020q3530533415 E L StTacomaWA98404620001Peter F & Cherie L SuskiBrandon L Che | Amanda CastanonOn Q Fin'lBrandon L Che | Amanda Castanon3415 E L ST TACOMA WA 984043924TACOMAWA98404.0161958.00.457508354000.0293000.0415000.0192042.00.00.01.4010701.0381401684.0
99514056806640318962020-03-022020-04-092020q1530535212 S Pine StTacomaWA98409629002Woolery T & A L/TRPatricia L SimanekBay EquityPatricia L Simanek5212 S PINE ST TACOMA WA 984096345TACOMAWA98409.0163000.00.610487267000.0210000.0323000.0104000.00.00.01.4136501.0393301330.0
99614056836791850902020-09-232020-10-152020q3530535602 S Oakes StTacomaWA98409630001Bill StreepyMilton D & Patricia A HardyVeterans United Hm LNSMilton D & Patricia A Hardy5602 S OAKES ST TACOMA WA 984096212TACOMAWA98409.023224.00.067316345000.0277000.0412000.0321776.00.00.00.0000000.0000001532.0
99714056846635717222020-02-212020-04-092020q1530535619 S Clement AveTacomaWA98409629003Brandon A AndrewsMitchell A Horn | Michelle L KonikowAmerican Pacific MtgMitchell A Horn | Michelle L Konikow5619 S S CLEMENT AVE TACOMA WA 98409TACOMAWA98409.0152859.00.434259352000.0275000.0428000.0199141.00.00.01.1107500.9659091520.0
99814056936701405912020-06-122020-07-232020q2530535636 S Cedar StTacomaWA98409629003Payton D KempBrian & Erlinda A TuskanBay EquityBrian & Erlinda A Tuskan15024 126TH AVE NE WOODINVILLE WA 980724670WOODINVILLEWA98072.027437.00.102377268000.0219000.0318000.0240563.00.00.01.4787801.2873101224.0
99914057056643370652020-03-032020-04-092020q1530532210 178th St ETacomaWA98445714061T Van TungKevin J Gonzales | Chiennsry K V AlcanceGuaranteed RateKevin J Gonzales | Chiennsry K V Alcance2210 178TH ST E TACOMA WA 984454212TACOMAWA98445.030000.00.088235340000.0309000.0370000.0176000.044000.090000.01.2895301.0794102076.0
100014057346740516402020-08-032020-08-272020q3530539828 Patterson St STacomaWA98444717051Korrissa A CustardJoseph C & Chelsea C BaldwinRoy & Patricia HellandJoseph C & Chelsea C Baldwin908 73RD ST E TACOMA WA 984045509TACOMAWA98404.0109600.00.397101276000.0217000.0336000.0166400.00.00.00.9103020.7427541064.0